A Study of Phoneme and Syllable Duration Characteristics of Mandarin Chinese

نویسندگان

  • Weizhong ZHU
  • Kenji MATSUI
چکیده

The multiple regression model was used to study the phoneme and syllable duration characteristics of mandarin Chinese. The source speech material is a phonetically balanced text corpus collected from newspapers and spoken by a professional female announcer. Since the syllable, in an Initial/Final format, was adopted as a basic synthesis unit in our Chinese TTS system, the investigations were taken on both Initial/Final and syllable bases. RMS error values of the model are 18.6, 36.9 and 43.1 ms for Initial, Final, and syllable, respectively. The results are quite close to those reported in literature, which may use different approaches, such as neural networks. In the multiple regression model, an interesting finding is that the factor of the following syllable is much larger than that of the preceding syllable. This evidence is further discussed by focusing into two-syllable words in the utterances. From our informal listening tests, we confirmed that this approach improves the naturalness of synthetic speech as compared to our previous rule-bases duration model.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Constructing initial phonology in Mandarin Chinese: Syllabic or subsyllabic? A masked priming investigation

Recent research has put forward the idea that Chinese speech production is governed by the syllable as the fundamental phonological unit. However, it may be that onset priming might be more difficult to obtain in Mandarin Chinese. Therefore, in this study, the degree of overlap between prime and target was increased from C to CV (i.e., extending beyond the phoneme) as well as whether primes and...

متن کامل

Perception of English syllable-final consonants by Chinese speakers and Japanese speakers

This study investigates perception by Mandarin Chinese and Japanese native speakers of English consonants in syllable-final position, and the effect of vowel duration as a cue to voicing in syllablefinal stops. Two experiments were conducted in the study and the results revealed that Mandarin Chinese speakers performed better than Japanese speakers but the position of the consonant in the sylla...

متن کامل

A Corpus Study of the Prosody of Polysyllabic Words in Mandarin Chinese

This paper presents a corpus study of polysyllabic words in Standard Mandarin Chinese. In particular, this study investigates their prosodic features with respect to the notions of prosodic strength and stress. We find a robust strong-weak alternation with respect to F0, but different patterns for duration. In disyllabic words the first syllable tends be slightly longer than the second. However...

متن کامل

Computation of L2 Speech Rhythm Based on Duration and Fundamental Frequency

Rhythmic characteristics of speech vary between native and non-native speakers. Studies comparing the rhythmic properties of L1 and L2 speech based on rhythm metrics have shown that this relationship is far from straightforward. It seems evidently the case that the difference between native and non-native speech is a complex interaction of a variety of rhythmic cues (duration, F0 and intensity)...

متن کامل

The Role of Phoneme in Mandarin Chinese Production: Evidence from ERPs

Established linguistic theoretical frameworks propose that alphabetic language speakers use phonemes as phonological encoding units during speech production whereas Mandarin Chinese speakers use syllables. This framework was challenged by recent neural evidence of facilitation induced by overlapping initial phonemes, raising the possibility that phonemes also contribute to the phonological enco...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2000